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Abstract 

Smale’s 17th problem asks for an algorithm which finds an approximate 
zero of polynomial systems in average polynomial time (see Smale [17]). 

The main progress on Smale’s problem is Beltran-Pardo [6] and Biirgisser- 
Cucker [9]. In this paper we will improve on both approaches and we prove 
an important intermediate result. Our main results are Theorem 1 on the 
complexity of a randomized algorithm which improves the result of [6], 
Theorem 2 on the average of the condition number of polynomial systems 
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which improves the estimate found in [9], and Theorem 3 on the complexity 
of finding a single zero of polynomial systems. This last Theorem is the 
main result of [9]. We give a proof of it relying only on homotopy methods, 
thus removing the need for the elimination theory methods used in [9]. We 
build on methods developed in Armentano et al. [2], 


1 Introduction 

Homotopy or continuation methods to solve a problem which might depend on 
parameters start with a problem instance and known solution and try to continue 
the solution along a path in parameter space ending at the problem we wish to 
solve. We recall how this works for the solutions of polynomial systems using a 
variant of Newton’s method to accomplish the continuation. 

Let Hd = be the complex vector space of degree d complex homogeneous 
polynomials in n + 1 variables. For a = {ao,... ,an) G ~ 

and the monomial = Zq° ■ ■ ■ z^", the Weyl Hermitian structure on Tid makes 
(2:", z^) := 0, for a 7^ /9 and 



Now for (d) = (di,..., d„) we let 'H(d) = 11^=1 '^d^- This is a complex vector 
space of dimension 

i=l ^ ^ 

That is, N is the size of a system / G 'H(d), understood as the number of complex 
numbers needed to describe /. 

We endow 'H(d) with the product Hermitian structure 

n 

if, 9 ) ■= '^{fh9i), 

k=l 

where / = (/i,..., fn), and g = {gi,, g^). This Hermitian structure is some¬ 
times called the Weyl, Bombieri-Weyl, or Kostlan Hermitian structure. It is 
invariant under unitary substitution / 1—)■ / o where U is a. unitary transfor¬ 
mation of (see Blum et al. [8, p. 118] for example). 

On we consider the usual Hermitian structure 

n 

{x,y) := '^XkW- 

k=0 

Given 0 ^ ( E let C"*" denotes Hermitian complement of (, 

G:={t«eC”+‘ : (t>,C> = 0}. 


2 



The subspace C"*" is a model for the tangent space, of the projective 

space at the equivalence class of ( (which we also denote by (). The 

space T(^P(C”^’'‘^) inherits an Hermitian structure from (•, •) given by 


{v,w)c 


{v,w) 

(C,C)' 


The group of unitary transformations U acts naturally on by 
for U eV, and the Hermitian structure of is invariant under this action. 

A zero of the system of equations / is a point x G such that fi{x) = 0, 
i = 1,... ,n. If we think of / as a mapping / : —)■ C”, it is a point x such 

that f{x) = 0. 

For a generic system (that is, for a Zariski open set of / G 'H{d))i Bezout’s the¬ 
orem states that the set of zeros consist of V ;= 111=1 complex lines through 0. 
These T> lines are V points in projective space P(C"’''^). So our goal will be to 
approximate one of these points, and we will use the so-called homotopy or con¬ 
tinuation methods. 

These methods for the solution of a system / G 'H(d) proceed as follows. 
Choose g G 'H(d) and a zero C ^ P(C”’''^) of g. Connect to / by a path ft, 
0 < t < 1, in 'H(d) such that /o = g, fi = f, and try to continue Co = C to Ct such 
that ftiCt) = 0 , so that /i(Ci) = 0 (see Beltran-Shub [7] for details or [10] for a 
complete discussion). 

So homotopy methods numerically approximate the path {ft, Ct)- One way to 
accomplish the approximation is via (projective) Newton’s methods. Given an 
approximation xt to Ct, dehne 


Xt+At := Nf^^^^{xt), 

where for h G 'H(rf) and y G P(C”’''^) we dehne the projective Newton’s method 
Nh{y) following [14]: 

Nh{y) ■=y- {Dh{y)\y±)-^h{y). 

Note that is dehned on P(C”’''^) at those points where Dh{y)\y± is invertible. 

That Xt is an approximate zero of ft with associated (exact) zero Ct means that 
the sequence of Newton iterations N^^{xt) converges immediately and quadrati- 
cally to Ct- 

Let us assume that {/t}te[o,i] is a path in the sphere S('H((t)) := {h G 'H(d) : 
||/t|| = 1}. The main result of Shub [13]^ is that the Ntk may be chosen so that 
to = 0, ffc = tk-i + Atk for k = 1,..., K with tx = 1, such that for all k, Xt^ is 
an approximate zero of ft^. with associated zero Ct*., and the number K of steps 
can be bounded as follows: 

K = K{f,g,0<CD'‘/^ ( 1 . 1 ) 

Jo 

^In Shub [13] the theorem is actually proven in the projective space instead of the sphere, 
which is sharper, but we only use the sphere version in this paper. 
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Here C* is a universal constant, D = maxidi, 




ll/ll ||P/(C)lc^) MiagdlCf’ 

oo 


if Df{()\(^± is invertible 
otherwise 


is the condition number of / G 'H(d) at ( E P(C"'+^), and 


is the norm of the tangent vector to the curve in (ft, (t)- The result in [13] is not 
fully constructive, but specihc constructions have been given, see [3] and [12], and 
even programmed [4], These constructions are similar to those given in Shub- 
Smale [16] and Armentano et ah [2] (this last, for the eigenvalue-eigenvector 
problem case). 

The right-hand side in expression (1.1) is known as the condition length of 
the path (ft,Ct)- We will call (1.1) the condition length estimate of the number 
of steps. 

Taking derivatives w.r.t. t in the equality ft(Ct) = 0 it is easily seen that 

Ct = (Dft(Ct)ktr"kCt), ( 1 . 2 ) 

and with some work (see [8, Lemma 12, p. 231] one can prove that 


\\Ch<MXt)\\ft 


It is known that /i(/, C) > 1; e,g., see [10, Prop. 16.19]. So the estimate (1.1) 
may be bounded from above by 

K(f,g,Q<C'D'‘/^ /V"(/i.Ci)ll/<ll*, (1.3) 

Jo 

where C = \/2C. Let us call this estimate the estimate. 

The condition length estimate is better than the /i^-estimate, but algorithms 
achieving the smaller number of steps are more subtle and the proofs of correct¬ 
ness more difficult. 

Indeed in Beltran-Pardo [5] and Biirgisser-Cucker [9] the authors rely on the 
/r^-estimate. At the times of these papers the algorithms achieving the condi¬ 
tion length bound where in development, and [9] includes a construction which 
achieves the /i^-estimate. 

Yet, in a random situation, one might expect the improvement to be similar to 
the improvement given by the average of ||A(a;)||, in all possible directions, com¬ 
pared with ||A|| (here. A: C"" —)■ C"" denotes a linear operator), which according 
to Armentano [1] should give an improvement by a factor of the square root of 
the domain dimension. We have accomplished this for the eigenvalue-eigenvector 
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problem in Armentano et al. [2]. Here we use an argument similar to that of [2] 
to improve the estimate for the randomized algorithm in Beltran-Pardo [6]. 

The Beltran-Pardo randomized algorithm works as follows (see Beltran- 
Pardo [6], and also Biirgisser-Cucker [9]): on input / G 'H(d), 

1. Choose /o at random and then a zero Co of /o at random. Beltran and 
Pardo [6] describe a general scheme to do so (roughly speaking, one hrst 
draws the “linear” part of /o, computes Co from it, and then draws the 
“nonlinear” part of /o). An efficient implementation of this scheme, having 
running time 0{nDN), is fully described and analyzed in [10, Section 17.6]. 

2. Then connect /o/||/o|| to //||/|| by an arc of a great circle in the sphere, 
and invoke the continuation strategy above. 

The main result of [6] is that the average number of steps of this procedure is 
bounded by 0 {D^^‘^nN), and its total average complexity is then 0 {D^^‘^nN‘^) 
(since the cost of an iteration of Newton’s method, assuming all di > 2, is 0{N), 
see [10, Proposition 16.32]). 

Our hrst main result is the following improvement of this last bound. 

Theorem 1 (Randomized algorithm) The average number of steps of the 
randomized algorithm with the condition length estimate is bounded by 


where C is a universal constant. 

The constant C can be taken as with C not more than 400 even ac¬ 
counting for input and round-off error, cf. Dedieu-Malajovich-Shub [12]. 

Remark 1 Theorem 1 is an improvement by a factor of 1 /A^^A of the bound 
in [6], which results from using the condition length estimate in place of the 
/i^-estimate. 

Before proceeding with the proof of Theorem 1, we introduce some useful 
notation. We dehne the solution variety 

V:={(/,C)e'H(d)xP(C“+‘)|/(C)=0}, 

and consider the projections 



V-id) P(C’^+i). 
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The set of ill-posed pairs is the subset 

S' := {(/,C) e V I Df{C)\^± is not invertible} = {(/,C) e V | /i(/,C) = 00 } 

and its projection S := 7ri(S') is the set of ill-posed systems. The number of 
iterations of the homotopy algorithm, K{f, g, (), is hnite if and only if the lifting 
{(/oCi)}te[o,i] of the segment {ft}te[o,i] does not cut S'. 


2 Proof of Theorem 1 


2.1 Preliminaries 


Let us start this section with a few general facts we will use from Gaussian 
measures. 

Given a hnite dimensional real vector space V of dimension m, with an inner 
product, we dehne two natural objects. 

• The unit sphere S(G) with the induced Riemannian structure and volume 
form: the volume of S(R) is ^7^- 


The Gaussian measure centered at c G R, with variance ^ > 0, whose 
density is 

1 II., ..||2/„2 


0-m^ml2 


^ — \\x—c\\^/o 


(2.5) 


We will denote by iVy(c, a^Id) the density given in (2.5). We will skip the 
notation of the underlying space when it is understood. Furthermore, we will 
denote by Exeu fhe average in the case a = 1 (that is, variance 1/2). 


Lemma 2 If(p: V ^ [0, +cx)] is measurable and homogeneous of degree p > —m, 
then 

Y ( m+p \ 

E(V^(:^)) = ^^ E 

x€V 1 (—) «GS(y) 


where 


2 

E I 

ne§{v) vol(§(R)) Js(y) 


ip{u) du. 


Proof. Integrating in polar coordinates we have 


E {(p{x)) 
xev 


j^ml2 

1 


(p{x) e ''"''I' dx 


'xev 

p+oo 


n 


.m/2 
m-\-p > 


p‘ 


m+p-l -p^ 


dp ■ 


r( 2 


JueS{v) 

Y ^ rri+p \ 


2Ti'rn/2 


I cp{u) du = 
Ge§(y) J- 


p>{u) du 


E {p>{u)) 
ues{v) 
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□ 


where we have used that dp = |r(^). 

The next results follows immediately from Fubini’s theorem. 

Lemma 3 Let E he a linear subspace ofV, and let H : V ^ E be the orthogonal 
projection. Then, for any integrable function ip : E ^ W and for any c E V, 
a > 0, we have 


E (^(n(a;))) = E ^y)). 

X'^Ny{c, a'^ld) y'^W£;(n(c),cr^Id) 


□ 


When is a hnite dimensional Hermitian vector space of complex dimen¬ 
sion m, then the complex Gaussian measure on V with variance is defined by 
the real Gaussian measure with variance of the 2m-dimensional real vector 
space associated to V with inner product the real part of the Hermitian product. 

In this fashion, for any hxed g G TL^d) and a > 0, the Hermitian space 
('H(d), (•, •)) is equipped with the complex Gaussian measure N{g,add). The 
expected value of a function 0 : 'H(d) —)■ IF with respect to this measure is given 
by 


E (0) 

/~W(g,<72ld) 


^2N^N 


’fen^d) 


0(/)e 




df. 


( 2 . 6 ) 


Fix any ( G P(C”’''^). Following [10, Sect. 16.3], the space 'H(d) is orthogonally 
decomposed into the sum © V^, where 


c< = {d“>g(y^)“^ “ec”}. 

and 

V< = Xj-‘(C) = {/£«,.,: /(O = 0}. 

Note that and are linear subspaces of 'H(d) of respective (complex) dimen¬ 
sions n and N — n. Note also that 


/o = /-d.a«(|Gf)/(0 

is the orthogonal projection n^(/) of / onto the hber V,;. This follows from the 
reproducing kernel property of the Weyl Hermitian product on "H^., namely, 

(gA-xr^) = 9(0, (2.7) 

for all g G and i = 1,... ,n. In particular, the norm of (•, G 77^. is equal 
to IlCfb 
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2.2 Average condition numbers 


In this section we revisit the average value of the operator and Frobenius condition 
numbers on 'H(d). The Frobenius condition number oi f at ( is given by 


lMf,() := ll/ll (£'/(C)lc4''diag(|ICII*-‘<i''''' 


( 2 . 8 ) 


that is, /ii? is dehned as /x but using Frobenius instead of operator norm. Note 
that /i < /iF < y/nfi. 

Given / G 'H(rf) \ S, the average of the condition numbers over the hber is 


C:/(C)=0 CT(C)=0 


(or cxD if / G S). For simplicity, in what follows we write § := §('H(d)). 

Estimates on the probability distribution of the condition number /i are known 
since [15]. The exact expected value of /iav(/) when / is in the sphere § was found 
in [6] and the following estimate for the expected value of /iav(/) when / is non- 
centered Gaussian was proved in [9]: for all / G 'H(d) and all a > 0, 


E 

/~Ar(/,f72id) 



^ e(n+l) 
- 2a2 


(2.9) 


The following result slightly improves (2.9), even though it is computed for fip- 


Theorem 2 (Average condition number) For every f G 'H(d) ^nd a > 0, 


/^F,av(/) / n 

wfr - 

and equality holds in the centered case. 


Remark 4 The equality of Theorem 2 implies from Lemma 2 with p = —2 that 

E hF,av(/) = (^^-l)^- 
/ 6 § 

Remark 5 In the proof of Theorem 2 we use the double-hbration technology, 
a strategy based on the use of the classical Goarea Formula, see for example 
[8, p. 241]. In order to integrate some real-valued function over FL^d:) whose 
value at some point / is an average over the hber 7rf^(/), we lift it to V and 
then pushforward to P(C”’''^) using the projections given in (1.4). The original 
expected value in ^{(^d) is then writen as an integral over P(C”’'“^) which involves 
the quotient of normal Jacobians of the projections Fi and 712 - More precisely. 




I ^ 0(/, C) df = 

C:/(C)=0 


'CeP(C"+i) 


[ 0(/,C)^^(/,C)c?7r2^(C)dC, 

^(yOevryhC) 

( 2 . 10 ) 
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where 


j^(/,C) = |det(D/(C)lc.)P 

(see [8, Section 13.2], [10, Section 17.3], or [2, Theorem 6.2] for further details 
and other examples of use). 

We point out that the proof of Theorem 2 can also be achieved using the 
(slightly) different method of [6] and [10, Chapter 18] based on the mapping 
taking (/, C) to {Df {(),() whose Jacobian is known to be constant (see [6, Main 
Lemma]). 


Proof of Theorem 2. By the dehnition of non-centered Gaussian, and 
the double-hbration formula (2.10), we have 


E 

/~Af(/;<72id) 


/^F,av(/) 

wfr 


]_ f ( C) \ 

W:/k=0 ^ 


-II/-/P 

e ^ 


^2N^N 


df 


( 2 . 11 ) 


1 1 f r /i^(/,C) 

T> 4gP(Cn+i) ^ i/evc ll/IP 


det(Zl/(C)lc^ 


2 e 




((T^Tt)^ 


dfdC, 


where we have used that ||/ — /|p = ||/ — n(^(/)|p -|- ||/ — n^(/)|p for every 

/e Vc. 

We simplify now the integral over the hber V,j. Let be a unitary trans¬ 
formation of such that f7(j(C/IICII) = ^o- Then, by the invariance under 
unitary substitution of each term under the integral sign, we have by the change 
of variable formula that 


Ir : = 






|2 e 


'/6Vc 


det(/l/(C)lc^r ^;(^_,)^^_, d/ 


HF{h,eo) 


ihev, 


eo 


det(iJh(eo)| 


—- 


i 2 e 

Sjf I 0 -2{N—n)^N—n 


dh, 


where := f oU^^. We project now h G Ve^ orthogonally onto the vector space 


Leo ■= {9 e 'H(d) ■ fi'(eo) = 0, D^g^eo) = 0 for /c > 2}, 


obtaining g G Leo- Since Dh{ef))\^^ coincides with Dg{eo)\^^ (see for example [10, 
Prop. 16.16]), we conclude by Fubini that 


k = 


'/leVe, 


-||>.-neo(fe^)E 

LF^k ek I |2 ^ dg 




' qGLsq 


det(D5((eo)|, 


i2 e 


-- 




dh, 
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where := ni;,g(/i^). By the change of variable given by 

have = ||A“^|||, and denoting by A(^ the image of we obtain that 


we 


h = E ||A ^11^ I det(A)|^ 

A&N(Ac_,ctHA„) 


We thus conclude from (2.11) that 


E 

f^Ndand) 


hF,av(/) 


1 1 
J) (j2n.^n 


-ll/-n^(/)||-^ 


'CeP(C"+i) 


kdC 


( 2 . 12 ) 


If we replace /iF,av(/)^/||/|P by the constant function 1 on 'H(rf), the same argu¬ 
ment leading to (2.12) now leads to 


1 = 


1 1 
T) 




'CeP(C"+i) 


E I det(24)p dC- (2-13) 

AeAf(A<;,(T2ld„) 


From Proposition 7.1 of Armentano et ah [2], we can bound 


Ic = E 

Aev(d^,(72id„) 


t-l||2 


Hid 


n 


det(A)| < — E |det(A)|" 
A&N{d,andn) 


(2.14) 


(with equality if Aq = 0). By combining (2.12), (2.14), and (2.13) we obtain 

FF,av(/) , n 


E 

/~W(/,a2ld) 


< 

2 - ^2 


Moreover, equality holds if / = 0 and hence = 0 for all (. 


□ 


2.3 Complexity of the randomized algorithm 

The goal of this section is to prove Theorem 1. To do so, we begin with some 
preliminaries. 

For / G § we denote by T/§ the tangent space at / of S. This space is 
equipped with the real part of the Hermitian structure of 'H(d), and coincides 
with the (real) orthogonal complement of / G 'H(d)- 

We consider the map 0: S x —t [0, cx)] defined for / ^ S by 

0(/,/):=^ h(/,C)||(/,0ll’ 

C:/(C)=0 
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where ( = {Df{()\(^±) ^f{C), and by 0(/, /) := cxd if / G S. Note that 0 satishes 

0(/, ^/) = /) for ^ > 0. 

Suppose that /o,/ G S are such that /o 7 ^ ±/ and denote by Cf^j the 
shorter great circle segment with endpoints /o and /. Moreover, let a = (i§(/o, /) 
denote the angle between /q and /. If [0,1] —)■ S, f 1 —)■ /t is the constant speed 
parametrization of with endpoints /o and /i = /, then ||/t|| = a. We may 
also parametrize by the arc-length s = at, setting Fg := fa-^s, fo which case 
Fg = a~^ft is the unit tangent vector (in the direction of the parametrization) to 
Cfgj at Fg. Moreover, 

[ 0(/t, ft) dt= [ (t){Fg, Fg) ds. 

Jo Jo 

Consider the compact submanifold iS of § x S given by 

5 = {(/,/) G§x§:/gT^§}, 

where T/§ denotes the real tangent space of § at /. We endow § with the 
Riemannian metric induced from the real part of the Hermitian product of 'H(d), 
and therefore S inherits the product Riemannian structure. 

The following lemma has been proven in Armentano et ah [ 2 ], 


Lemma 6 We have 


I<f> E 

/o,/e§ 


0(/t, ft) dt 


^ E {fifj)), 


where the expectation on the right hand-side refers to the uniform distribution 
on S. 


We proceed with a further auxiliary result. For / G § we consider the unit 
sphere Sf := {/ G Tj§ : (/, /) G 5} in Tj§. 

Lemma 7 Fix / G S and ( G with f{() = 0. For f G Sf let ( be the 

function of (/,/) and ( given by ( = {Df{()\^±)~^f{(). Then we have 

E (iicir") = ^ll(c/(c)k.r‘ii^. 

/ 6 < 5 / - 2 

where the expectation is with respect to the uniform probability distribution on 

‘5/. 

Proof. Since the map TfS —)■ M, / 1 —)■ ||C(/)|P is quadratic, we get from 
Lemma 2 (recall that dimTjS = 2N — 1) 

E (||C(/)ir)= E (||C(/)ir)- 
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Note that the mapping 'H(d) —)■ Cq given by / i—)■ is an orthogonal projection, 

and furthermore Cc_ C” given by / e-)■ /(C) is a linear isometry. Then from 
Lemma 3, and the change of variables formula we obtain 

E (||c(/)ir)= E ||(D/(oi(n-‘®it = ||(D/(oicn-‘||^. 

j&TfS weC^ 

where the last equality is straightforward. □ 

Proof of Theorem 1. From (1.1), using the notation from there, we know 
that the number of Newton steps of the homotopy with starting pair (/o, Co) and 
target system / is bounded as 

A'(/./o,Co) < f Kf„<;,)\\(J„Q)\\dt. 

Jo 

Hence we get for /, /o G §, 

1 51 A'(/,/o, Co) < CDS/" /l ^ a(/„C.)IK/..C.)II* 

Co:/o(Co)=0 ^Co:/o(Co)=0 

= CD^/^ [ 

Jo 

Therefore, by Lemma 6, 


E 

/,/oeS 


(/ E /f(/./o.Co)) < CDS/"f 

Co:/o(Co)=0 


E (/(/,/)). 
(/J)6<S 


(2.15) 


It is easy to check that the projection 5 —)■ §, (/,/) e-)■ /, has the Normal 
Jacobian l/\/2- From the coarea formula, we therefore obtain 


E (/(/,/))= y2E E (/(/,/)) 
(/J)6<s /es feSf 


= V2E 

/e§ 



In order to estimate this last quantity, note hrst that from the Cauchy-Schwartz 
inequality 


E ((1+iicir)^) < 

/e<S/ 

< 


1+ E (iicir) 

^ /e<S/ ^ 

/I \ 
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the last by Lemma 7. Now we use ||(-D/(C)lc^) ^II-F < hF(/, C) and /i(/, C) < 
fipifX) to deduce 


< E |i ^ 


/eS VP 


C:/(C)=0 


(iV-i). 


(^-7)^ , ^ ^/^F,av(/) 


+ E 


2 /e§V(iV-i)i 
the second inequality since for all a; > 0 and a > 0 we have 


X 


1/2 


(1 + < - \- ax. 


2 a 


A call to Remark 4 hnally yields 


^ E W/./)) < + < W(l + 

v2(/,/)e5 2 (iV-i )2 V2 

Replacing this bound in (2.15) hnishes the proof. 


n 


□ 


3 A Deterministic Algorithm 

A deterministic solution for Smale’s 17th problem is yet to be found. The state 
of the art for this theme is given in [9] where the following result is proven. 

Theorem 3 There is a deterministic real-number algorithm that on input f G 
'H(d) computes an approximate zero of f in average time Moreover, 

if we restrict data to polynomials satisfying 

D < ni+e or D > 

for some hxed e > 0, then the average time of the algorithm is polynomial in the 
input size N. 

The algorithm exhibited in [9] uses two algorithmic strategies according to 
whether D < n or D > n. In the hrst case, it applies a homotopy method and in 
the second an adaptation of a method coming from symbolic computation. 

The goal of this section is to show that a more unihed approach, where ho¬ 
motopy methods are used in both cases, yields a proof of Theorem 3 as well. 
Besides a gain in expositional simplicity, this approach can claim for it the well- 
established numerical stability of homotopy methods. 
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In all what follows we assume the simpler homotopy algorithm in [9] (as 
opposed to those in [3, 12]). Its choice of step length at the kth. iteration is 
proportional to (which, in turn, is proportional to /i~^(/ 4 , Ctfc))- 

this algorithm, we have the /x^-estimate (1.3) but not the hner estimate (1.1). 

To understand the technical requirements of the analysis of a deterministic 
algorithm, let us summarize an analysis (simpler than the one in the preceding 
section because of the assumption above) for the randomized algorithm. Recall, 
the latter draws an initial pair ( 5 ',C) from a distribution which amounts to hrst 
draw g from the distribution on § and then draw C uniformly among the T> 
zeros • • •; of 9 - The /x^-estimate (1.3) provides an upper bound for 
the number of steps needed to continue C to a zero of / following the great circle 
from g to f (assuming ||/|| = || 5 f|| = 1 and / 7 ^ ± 5 ^). Now (1.3) does not change 
if we reparametrize {/t}te[o,i] by arc-length, so we can also write it as 

rdsiaJ) 

Jo 


where ds{g, f) is the spherical distance from g to f. Thus, the average number 
of homotopy iterations satisfies 


1 


V 


I rds{9,f) 




2=1 


rMgJ) 

< C EE/ 

f&Sg&S Jq 


9 'F,^Ms)ds. (3.16) 


Let Pg denote the set of pairs (/,(?) G such that ds{g,f) > s. Rewriting the 
above integral using Fubini, we get 


E E 

/SS gSS 


"dsigj) 


2 

9'F,a.v 


ifs) ds 



n 


FF,av(/s) dfdgds = - E 
2 hes 


Xh), 


the second equality holding since for a hxed s E [ 0 , 7 r] and uniformly distributed 
{f,g) G Ps, one can show that the system /* is uniformly distributed on §. 
Summarizing, we get 


1 ^ 

E E 
/eSgeS P 


a 




t^^/9 


= C -l)n 

Rmk. 4 2 


This constitutes an elegant derivation of the previous 0{nD^f‘^N) bound (but 
not of the sharper bound of our Theorem 1). 

Proof of Theorem 3. If the initial pair {g, () is not going to be random we 
face two difficulties. Firstly —as g is not random— the intermediate systems ft 
are not going to be uniformly distributed on S. Secondly —as ( is not random— 
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we will need a bound on a given (t) rather than one on the mean of these 

quantities (over the V possible zeros of ft), as provided by Theorem 2. 

Consider a fixed initial pair {g, () with g E S and let si be the step length of 
the hrst step of the algorithm (see for example the dehnition of Algorithm ALH 
in [9]), which satishes 


c 


(c a constant). 


(3.17) 


Note that this bound on the length si of the first homotopy step depends on the 
condition fi{g, Q only and is thus independent of the condition at the other zeros 
of g. Any of the mentioned versions of the continuation algorithm, not only that 
of [9], satisfies (3.17). 

Consider also the (small) portion of great circle contained in § with endpoints 
g and //||/|1, which we parametrize by arc length and call hg (that is, ho = g 
and ha = //||/|| where a = ds{g, f/\\f\\)) dehned for s G [0,q;]. Thus, after the 
hrst step of the homotopy, the current pair is {hs^,xi) and we denote by C,' the 
zero of hsi associated to Xi. For a time to come, we will focus on bounding the 
quantity 

1 ® 

H:=H{g,Q.= E :=r A-(//||/|l, fc,., («), 

where the sum is over all the zeros of hsj. This is the average of the number 
of homotopy steps over both the system / and the V zeros of We will be 
interested in this average even though we will not consider algorithms following a 
path randomly chosen: the homotopy starts at the pair {g,C), moves to (hsj,a;i) 
and proceeds following this path. 

From (1.3) applied to {hsj^,C'), 


A’(//ll/ll.< C'D^/^ r ^L^h„(P)\\h,\\ds, (3.18) 

J Si 


Reparametrizing {hg : <si < s < a} by {/t/||/t|| : < t < 1} where ft = 

(1 — t)g + tf and ti is such that /ti/||/ii|| = hg^ (see Lemma 8 below) does not 
change the value of the path integral in (3.18). Moreover, a simple computation 
shows that 


d ( ft \ 
dt\\\ft\\J 


< 



u 


2 ’ 


SO we have 


A'(//ii/ii.A...c'q < 


C'll/ll 


k II/.IP 


dt. 


(3.19) 


Because of scale invariance, the quantity H satishes 


H 


1 




5^A'(/,/i„,C<''), 


2=1 
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where the second expectation is taken over a truncated Gaussian (that only draws 
systems / with ||/|| < \/2N) with density function given by 

1 0 otherwise. 


Here (p is the density function of the standard Gaussian on 'H(rf) and p : = 
Prob{||/|| < V2N}. Then, using (3.19), 


H < VmC 





From Lemma 8 we have 

^ c' 

D3/2y]V/i2(^,0 

for a constant d (different from, but close to, c). We thus have proved that there 
are constants C", d such that 


H 


^ ^ J _^_ 


w 


= C'y/N 


E 




vw 

(d) 




(9,0 



dt. 


The 2N in the cut-off for the truncated Gaussian is the expectation of ||/|P for 
a standard Gaussian / G 'H(d). Since this expectation is not smaller than the 
median of ||/|p (see [11, Gor. 6]) we have ^ < 2. Using this inequality and the 
fact that the random variable we are taking the expectation of is nonnegative, 
we deduce that 


H < 2C"\/N E 


dlvift) 

ur 


dt 


< 2C"\fN 


dpavift) ^ 
E —— dt. 


53/2vWm(9,02 11-^*1 


We next bound the expectation in the right-hand side using Theorem 2 and the 
fact that ft ~ ,^((1 — t)g,tHd) and obtain 


H < 2nC"VND^/^ [ \dt 

j ^ c.' V 

IWvWfFo 

< C'"D^nNp\gX). 


(3.20) 
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with C" yet another constant. 

Having reached thus far, the major obstacle we face is that the quantity i?, 
for which we derived the bound (3.20), is an average over all initial zeros of 
(as well as over /). For this obstacle, none of the two solutions below is fully 
satisfactory, but each of them is so for a broad choice of pairs (n, D\ 

Case 1: D > n. Consider any g E E>, ( a well-posed zero of g, and let 
..., be the zeros of Note that when / is Gaussian, these are V 
different zeros almost surely. Clearly, 

V 

E K{f,gX) < 1+ E y^K{f,hs„C^^) 

= 1 + VH = OiVD^Nnu'^igX)) 

the last by (3.20). We now take as initial pair {g,C) the pair {g,eo) where g = 
is given by 

9i = + ■ ■ ■ + for i = 1,..., n 

(the scaling factor guaranteeing that ||^|| = 1) and cq = (1, 0,..., 0) G It 

is known that /i(^, Cq) = 1 (see [10, Rem. 16.18]) (and that all other zeros of g 
are ill-posed, but this is not relevant for our argument). Replacing this equality 
in the bound above we obtain 

E K{f,g,eo) = 0{VD^Nn), (3.21) 

which implies an average cost of 0{T>D^N^n) since the number of operations at 
each iteration of the homotopy algorithm is 0{N) (see [10, Proposition 16.32]). 

For any e > 0 this quantity is polynomially bounded in N provided D > 
and is bounded as jj jg ^j^g range [n, ([9, Lemma 11.1]). 

Case 2: D < n. The occurrence of V makes the bound in (3.21) too large 
when D is small. In this case, we consider the initial pair [U, zi) where U G ^{d) 
is given by 

u, = -X(XS‘ - A'*), = ^(At" - Xi"), 

v2n v2n 

(the scaling factor guaranteeing that ||[/|| = 1) and zi = (1,1,..., 1). We denote 
by zi,..., zx) the zeros of U. 

The reason for this choice is a strong presence of symmetries. These symme¬ 
tries guarantee that, for all 1 < < "D, 




(3.22) 
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and, consequently, that the value of si is the same for all the zeros of U. Hence, 


E K{f,U,z,) 


1 

V 


V E 


K{f,U,z,) 


E 




E 






(3.23) 


That is, the average (w.r.t. /) number of homotopy steps with initial system U 
is the same no matter whether the zero of U is taken at random or set to be zi. 
Also, 

/i2(I7,zi) < 2(n + l)^ (3.24) 

(actually such bound holds for all zeros of U but, again, this is not relevant for 
our argument). Both (3.22) and (3.24) are proved in [9, Section 10.2]. It follows 
from (3.23), (3.20), and (3.24) that 

E K{f,U,zi) = 0{D^Nn^+^). (3.25) 

/S'H(d) 


As above, for any hxed e > 0 this bound is polynomial in N provided D < 
and is bounded by jj ^ [ni+?,n]. □ 


Lemma 8 With the notations of the proof of Theorem 3 we have 


U = 


sinacot(sia) - ||/|| cosa + 1 p?{gXy 


d a constant. 

Proof. The formula for ti is shown in 

ll/ll sinQ;cot(siQ;) — ll/ll cosa + 1 < 

< 

< 

< 


[9, Prop. 5.2]. For the bound, we have 


ll/ll sina(sia) ^ + ||/|| + 1 

+1 

Si 


VNDd^fi\g,0 


+ 1 + 



for an appropriately chosen d. 


□ 
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